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Human Telomerase 
INTRODUCTION 

Field of the In<"fflTiftn 

The field of this invention is an enzyme involved in cell replication. 

Background 

DNA at chromosome ends is maintained in a dynamic balance of loss and addition 
of telomeric simple sequence repeats. Sequence loss occurs during cell replication, in pan 
from incomplete replication of chromosome termini by DNA-dependent DNA polymerase. 
Telomeric repeat addition is catalyzed by the enzyme telomerase: a ribonucleoprotein 
enzyme which uses a short region within the RNA as a template for the polymerase 
reaction. Although cells can maintain a constant number of telomeric repeats by balancing 
repeat loss and addition, not all cells do so. Human germline and cancer cells maintain a 
constant number of telomeric repeats, while norma! human somatic cells lose telomeric 
repeats with each cycle of cell division. Cells which do not maintain stable telomere length 
demonstrate a limited proliferative capacity: these cells senesce after a number of 
population doublings correlated with the erosion of telomeres to a critical minimum length. 

Because normal somatic cells do not appear to express or require telomerase and do 
not maintain chromosome ends, and because all or almost all cancer ceils express high 
levels of telomerase activity and maintain chromosome ends, molecules that inhibit or alter 
lelomerase activity could provide effective and non-toxic anticancer agents. Similarly, 
inhibition of telomerase in parasitic or infectious agents (e.g. trypanosomes. fungi, etc.) 
could provide a specific approach for reducing the viability or proliferation of these agents. 

1 
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Conv^v, aoiva.™ of ttlammx jn ^^.^ ^ ^ ^ 
son, a „c cei ,s bteod , vascolrare _ |iw ^ eK ; coum ^ a raK|]MjOT ^ 
promoting additional proliferative lifespan. 

Relevant T.it^,^ 

Purification of teiomerasc ftom the ciliate Tetrahymena and cloning of genes 
cncc^ng two protein components of thcenzyn.eis^^^^, gj 
677-686 and copending US patent application No. 08/359.125, filed 19 DEC 1994 
^eratorerelatingtohu^te^^^^ . (1994) Science 266, 201 ,-201 4- 
-^--.a^S--^^. Literaturerelatingtotelornerasetc.pUte ' 
modtficafcons allude Autexier et al (,994, Genes and Deve. 8. 563-575; Yu et al (1991) 

Cell 67, 823-832; and Yuetal. (1990) Nature 344 I2M32 TheW« h - r, 
w^.-.-^. ' U6 -"2. The Washmgton University- 

Merck EST Project contains an EST, reportedly deposited by Hiilier et al. on Nov I , 995 
whichhasscquencesinu^^^ ^ ' 

general review, see Blackburn et al.. Eds. (1995) Telomeres, Cold Spring Harbor 
'5 Laboratory Press. 
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SUMMARY OF THE INVENTION 
The mvention provide methods and common, relating t0 a human telomere 
andrelatednucleicacids. ^vcto*^^^^ 

calledpl40.p,05,M8andp43andte,ome raS e Pro teindom^ 

Reactivity. Ttop^^top^nc-^^,,..^^^ 

^^"ttetomcr^ ^ 

thereo£ d ^ k^* 0 te ' ome ' M e RNA components, as well as specific, functional derivatives 

The invention provides isolated, telomere hybridization probes and primers 
capable ofspecific^y hybridizing wim me disclosed telomerase gen, telomcrase-^ecific 
bmdmg agents such as specific antibodies, and methods of making and using the subject 
composmons in diagnosis (e.g. genetic hybridization screens for telomere transcripts) 
therapy (e.g. gene therapy to modulate telomere gene expression) and in the 
btopharmaceutical industry (e.g. reagents for screening chemical libraries for lead 
pharmacological agents and nucleic acid polymerase reagents). 



25 
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SEQ ID LISTING 
SEQ ID NO.i: pl05 protein (amino acid sequence) 
SEQ ID NO:2: pi 05 ambiguity maximized synthetic DNA 
SEQ ID NO:3: pl05 natural cDNA (the coding region is bp 97-2370) 
SEQ ID NO:4: pi 05 E. coli optimized synthetic DNA 
SEQ ID NO:5: pi 05 mammalian optimized synthetic DNA 
SEQ ID NO:6: telomerase RNA 

SEQ ID NO:7: telomerase RNA template region modification 1 

SEQ ID NO:8: telomerase RNA template region modification 2 

SEQ ID NO:9: telomerase RNA template region modification 3 

SEQ ID NO: 10 p43 peptide (XXXEAAT[I/L]D(I/L]PQQGANK, where the three X's arc 

indeterminant residues) 



DETAILED DESCRIPTION OF THE INVENTION 
The invention provides isolated human telomerase proteins including human 
15 telomerase proteins pl40, pl05, p48 and p43, having molecular weights of about 140kD, 
about !05kD, about 48kD and about 43kD, respectively, as determined by polyacrylamide 
gel electrophoresis under denaturing conditions (Matsudaira and Burgess (1978) Anal 
Biochem 87, 386-396), and telomerase protein domains thereof. The telomerase proteins 
comprise assay-discemable functional domains including RNA recognition motifs and 
20 subunit binding domains and may he provided as fusion products, e.g with non-telomerase 
polypeptides. The human telomerase proteins of the invention, including the subject protein 
domains, all have telomerase-specific activity or function. 

Tctomerase-specific activity or function may be determined by convenient in vitro. 
cell-based, or in vivo assays: e.g. in vitro binding assays, cell culture assays, in animals (e.g. 
25 immune response, gene therapy, transgenics, etc.), etc. Binding assays encompass any 
assay where the molecular interaction of a telomerase protein with a binding target is 
evaluated The binding target may be a natural intracellular binding target such as a 
telomerase subunit (e.g. another protein subunit or RNA subunit), a substrate, agonist, 
antagonist, chaperone. or other regulator that directly modulates telomerase activity or its 
30 localization; or non-natural binding target such a specific immune protein such as an 

antibody, or a telomerase specific agent such as those identified in assays described below. 
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Generally, telomere- binding specificity is assayed by telomere polymerase activity (see. 
e.g. Collins et al. 1995, Cell 81, 677-686). by binding equilibrium constants (usually at least 
about 10 ? M-'. preferably at least about 10* M'\ more preferably at least about 10' M"). by 
the ability of the subject protein to function as negative mutants in tclomerase-expressing 
cells, to elicit telomere specific antibody in a heterologous host (e.g a rodent or rabbit), 
5 etc. In any event, the telomerase binding specificity of the subject telomere proteins 
necessarily distinguishes ciliate telomerase, preferably distinguishes non-mammalian 
telomeres and more preferably distinguishes non-human telomeres. Exemplary 
telomerase proteins which are shown to have telomerase binding specificity include the 
telomerase RNA (e.g. SEQ ID NO:6) binding domains (e.g. RRM 1 -4: SEQ ID NO: 1 , about 
10 residues 5-81, residues 115-192. residues 336-420. and residues 487-578, respectively). 

telomerase primer binding domains, nucleotide triphosphate binding domains and binding 
domains of regulators of telomerase such as nuclear localization proteins, etc. As used 
herein, a protein domain comprises at least 12, preferably at least about 20. more preferably 
at least about 40, most preferably at least about 80 residues of the disclosed respective SEQ 
15 ID NO. 

The claimed human telomerase proteins are isolated or pure: an "isolated" protein is 
unaccompanied by at least some of the material with which it is associated in its natural 
state, preferably constituting at least about 0.5%, and more preferably at least about 5% by 
weight of the total protein in a given sample and a pure protein constitutes at least about 

10 90%, and preferably at least about 99% by weight of the total protein in a given sample. 

The telomerase proteins and protein domains may be synthesized, produced by recombinant 
technology, or purified from human cells. A wide variety of molecular and biochemical 
methods are available for biochemical synthesis, molecular expression and purification of 
the subject compositions, see e.g. Molecular Cloning, A Laboratory Manual (Sambrook. et 

5 al. Cold Spring Harbor Laboratory), Current Protocols in Molecular Biology (Eds. Ausubel. 
et al., Greene Publ. Assoc., Wiley-Interscience, NY) or that are otherwise known in the art. 
An exemplary method for isolating each of human telomerase protein P 140, pl05. p48 and 
p43 from human cells is as follows: 

Several thousand (two to twelve thousand) liters of HeLa cell* are grown in spinner 

) culture. The cells are lysed by dounce homogenization in low-salt buffer to produce crude 
cell lysates. The lysates are supplemented with 15% glycerol and centrifuged at 125.000 x 



10 
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S for 50 minutes to obtain a first soluble fraction enriched for telomerase activity (S-100 
fraction). The S-100 fraction is adjusted to 0.2 M ionium sulfate, bound to SP 
Sepharose 

(Pharmacia), and developed with a gradient in sodium chloride, to obtain a second soluble 
fraction enriched for telomerase (SP fraction). The SP fraction is adjusted to about 0.3-0.4 
5 M ionic strength and bound to Q Sepharose (Pharmacia), and developed with a gradient in 
sodium chloride, to obtain a third soluble fraction enriched for telomerase (Q fraction). The 
Q fraction is adjusted to about 0.3-0.4 M ionic strength, bound to phosphocelluiose 
(Whatman), and developed with sodium chloride, to obtain a fourth soluble fraction 
enriched for telomerase (PC fraction). The PC fraction is adjusted to about 0.3-0.4 M ionic 
strength, bound to 2'Omethyt RNA oligonucleotide immobilized on streptavidin agarose 
(Sigma), and eluted with a electrophoresis sample medium comprising 5% p. 
mercaptoethanol and 2% Sodium Dodecyl Sulfate to obtain a fifth soluble fraction 
(2'Omethyl fraction). The 2'Omethyl fraction is separated by polyacrylamide gel 
electrophoresis under denaturing conditions (Matsudaira and Burgess (1978) Anal Biochem 
1 5 87, 386-396) to obtain gel protein bands at a molecular weight of about 1 40kD, 1 05kD, 
48kD or 43kD having telomerase activity. The gel bands are excised or blotted to obtain 
purified human telomerase proteins pl40, pl05, p48 and p43. 

The subject telomerase proteins find a wide variety of uses including use in 
isolating, enriching for and concentrating telomerase RNA and telomerase proteins, as 
20 immunogens, in the methods and applications described below, as reagents in the 

biotechnology industries, and in therapy. Recombinant telomerases are used in many 
applications where nascent oligonucleotides of predetermined sequence are desired. For 
example, native nucleic acid molecules are labeled or extended at their 3' ends by addition 
of a predetermined repeat sequence (for double-stranded oligonucleotides, both ends of the 
>5 molecule may be tagged). Oligonucleotides complementary to the repeat are then used to 
amplify, sequence, affinity purify, etc. the nucleic acid molecules. The use ota repeat 
sequence for 3' end tagging improves specificity and provides sequence alternatives 
compared with non-templated enzymes presently available for this purpose, e.g. terminal 
transferase. Repeats encoding restrict™ enzyme sites provide repeat tagging to facilitate 
cloning and the use of telomerase alleviates the restrictive conditions required for optimal 
ligation with available iigase enzymes. Telomerase also finds use in regulating cell growth 

5 
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or increasmg cell density tolerance; for example, cells contacted with an effective amount of 
exogenous telomerase to overcome the growth control limitation otherwise imposed by 
short telomere length. Telomere may be introduced, expressed, or repressed in specific 
populations of cells by any convenient way such as microinjection, promoter-specific 
expression of recombinant enzyme, targeted delivery of lipid vesicles, etc. Advantageously, 
only a brief period of telomere activity is required to allow many generations of continued 
proliferation of the contacted ceil, due to the ability of telomerase to extend telomeres in 
one cell cycle by more sequence than is lost with each cell division. 

The invention provides natural and non-natural human telomerase-speciftc binding 
agents including substrates, agonist, antagonist, etc.. methods of identifying and making 
such agents, and their use in diagnosis, therapy and pharmaceutical development. For 
example, human telomerase-specific agents are useful in a variety of diagnostic and 
therapeutic applications. Novel human telomerase-specific binding agents include human 
telomerase-specific receptors, such as somatically recombined protein receptors like 
specific antibodies or T-cell antigen receptors (see, e.g Harlow and Lane (1988) Antibodies, 
A Laboratory Manual Cold Spring Harbor Laboratory) and other natural intracellular 
binding agents identified with assays such as one-, two- and three-hybrid screens, non- 
natural intracellular binding agents identified in screens of chemical libraries such as 
described below, etc. Agents of particular interest modulate human telomerase function, 
e.g. human telomerase antagonists and find use methods for modulating the binding of a 
human telomerase or telomerase protein to a human telomerase binding target. 

For diagostic uses, the binding agents are frequently labeled, such as with 
fluorescent, radioactive, chcmiluminescent, or other easily detectable molecules, either 
conjugated directly to the binding agent or conjugated to a probe specific for the binding 
agent. Binding agents also find use in modulating the telomerase activity present in a cell. 
For example, isolated cells, whole tissues, or individuals may be treated with a telomerase 
binding agent to activate, inhibit, or alter the specificity of telomerase assembly, 
localization, substrate interaction, or synthesis activity. Effectively treated cells have 
increased or decreased replication potential, or suffer from loss of proper telomere structure 
(resulting in lethality). These binding agents also find therapeutic use to control cell 
proliferation; for example, the uncontrolled growth of transformed cells (e.g. cancer cells) is 
managed by administration to the cells or patient comprising such ceils of a telomerase 

6 
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binding agent which reduces telomere activity. In contrast to many current 
chemotherapies, the present invention provides enhanced specificity of lethality, with 
minimum tonicity to dividing yet normal somatic cells. 

The amino acid sequences of the disclosed telomerase proteins are used to back- 
translate telomerase protein-encoding nucleic acids optimized for selected expression 
5 systems (Holler et al. (1993) Gene 136, 323-328; Martin et al. (1995) Gene 154, 150-166) 
or used to generate degenerate oligonucleotide primers and probes for use in the isolation of 
natural telomerase encoding nucleic acid sequences ("GCG" software, Genetics Computer 
Group, Inc. Madison WT). As examples. SEQ ID NO:2 discloses an ambiguiry-maxirnized 
p!05 coding sequence encompassing all possible nucleic acids encoding the full-length 
pl05 protein. SEQ ID NO:3 discloses a natural human cDNA sequence encoding pl05, 
SEQ ID NO:4 is a pl05 coding sequence codon-optimized for E. coli, SEQ ID NO:5 is a 
pi 05 coding sequence codon optimized for mammalian cell expression. Telomerase 
encoding nucleic acids may be part of human telomerase-expression vectors and may be 
incorporated into recombinant host cells, e.g. for expression and screening, transgenic 
1 5 animals, e.g. for functional studies such as the efficacy of candidate drugs for disease 

associated with human telomerase-mcdiated signal transduction, etc. Expression systems 
are selected and/or tailored to effect human telomerase protein structural and functional 
variants through alternative post-trans lan'onal processing. 

The invention also provides nucleic acid hybridization probes and 
replication/amplification primers having a human telomerase cDNA specific sequence 
contained in SEQ ID NO:3. bases 1 -2345, and sufficient to effect specific hybridization 
thereto (i.e. specifically hybridize with SEQ ID NO:3, bases 1-2345 in the presence of 
natural ciliate telomerase cDNA, preferably in the presence of non-mammalian telomerase 
cDNA and more preferably, in the presence of murine telomerase cDNA). Demonstrating 
25 specific hybridization generally requires stringent conditions, for example, hybridizing in a 
buffer comprising 30% formamide in 5 x SSPE (0.18 M NaCl. 0.01 M NaP0 4f pH7.7, 0.001 
M EDTA) buffer at a temperature of 42 T and remaining bound when subject to washing at 
42°C with 0.2 x SSPE; preferably hybridizing in a buffer comprising 50% formamide in 5 x 
SSPE buffer at a temperature of 42 : C and remaining bound when subject to washing at 
30 42°C with 0.2 x SSPE buffer at 42 : C. Human telomerase cDNA homologs can also be 

distinguished from other protein using alignment algorithms, such as BLASTX (Altschul ei 
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al. (1990) Basic Local Alignment Search Tool, J Mol Biol 215. 403-410). 

The invention also provides non-namral sequence, recombinant and isolated natural 
sequence human telomere RNA. Natural human telomere RNA sequences include the 
nuclei acid disclosed as SEQ ID NO:6, or a fragment thereof sufficient to specifically 
hybnduewithanucleic add having the sequence defined by SEQIDNO* in the presence 
of a nucletc acid having the sequence disclosed in Feng et al. 1995. Science 269. 1236- 
1241. Such fragments necessarily distinguish the previously described (Feng et al 1995 
S CI ence269. 1 236- 1 24 1 ) human RNA species. Preferred such fragments comprise SEo'lD 
NO:6. bases 191-210. ba*s 245-239. bases 341-369 or bases 381-399. Non-natural 
sequences mclude derivatives and/or mutations of SEQ ID NO:6. where such 
derivatives/mutation, provide action in template, protein binding, or other regions to 
effect altered telomere substrate specificity or altered reaction product (e.g any 
predetermined sequence), etc, see, e.g. Autexier et al.. 1994, Genes & Develop 8 563-575- 
Collins et al. (1995) EMBO J. 14, 5422-5432; Greider et al. (1995) Structure and ' 
B,ochemistryofCil^ 

Ed^ColdSpringHarborLabonttoryPre*. Additional derivatives function as dominant 
negaave fragments which effectively compete for telomere assembly. For examples 
SEQ E> NO:7, Sand 9 are derivatives which provide for modified substrate specificity and 
polymerase reaction product to interfere with cellular function (see, e.g. Hanish et al. (1994) 
Proc Natl Acad Sci 91, 8861-8865). 

The subject nucleic acids are of synthetic/non-natural sequences and/or are isolated 
..c. unaccompanied by at least some of the material with which it is associated in its natural' 
state, preferably constituting at «e«t about 0.5'* preferably at least about 5% by weight of 
total nucleic acid present in . given fraction, and usual.y recombinant, meaning thev 
compnsean^n-natu^seq^oranatural sequence joined to nucleotides) other'than 
that which it is joined to on a namra, chromosome. Nucleic acids comprising the nucleotide 
sequence of SEQ ID NO:3 or fragments thereof, contain such sequence or fragment at a 
termmus. immediately flanked by a sequence other than that which it is joined to on a 
natural chromo«,me, or flanked by a native flanking region fewer than 10 kb. preferably 
fewer than 2 kb, which is immediately flanked by a sequence other than that which it is 
jo.ned to on a natural chromosome. While the nucleic acids arc usually RNA or DNA. it is 

often advantageous to use nucleic acids comr»n«n 0 „,k— u , . , 

Ias cor *pnsing other bases or nucleotide analogs to 

8 
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provide modified stability, etc. The subject nucleic acids find a wide variety of applications 
including use as translatable transcripts, hybridization probes, PCR primers, diagnostic 
nucleic acids, etc.; use in detecting the presence of human telomerase genes and gene 
transcripts and in detecting or amplifying nucleic acids encoding additional human 
telomerase homologs and structural analogs. 

In diagnosis, human telomerase hybridization probes find use in identifying wild- 
type and mutant human telomerase alleles in clinical and laboratory samples. Mutant 
alleles are used to generate allelc-specific oligonucleotide (ASO) probes for high- 
throughput clinical diagnoses. In therapy, therapeutic human telomerase nucleic acids are 
used to modulate cellular expression or intracellular concentration or availability of active 
telomerase. A wide variety of indications may be treated, either prophylactically or 
therapeutically with the subject compositions. For example, where limitation of cell growth 
is desired, eg. neoproliferative disease, a reduction in telomerase expression is effected by 
introducing into the targeted cell type human telomerase nucleic acids which reduce the 
functional expression of human telomerase gene products (e.g. nucleic acids capable of 
1 5 inhibiting translation of a functional telomerase transcript). Conditions for treatment 
include various cancers, where any of a wide variety of cell types may be involved, 
restenosis, where vascular smooth muscle cells are involved, inflammatory disease states, 
where endothelial cells, inflammatory cells and glomerular cells are involved, myocardial 
infarction, where heart muscle cells are involved, glomerular nephritis, where kidney cells 
20 are involved, transplant rejection where endothelial cells are involved, infectious diseases 
such as HIV infection where certain immune cells and other infected cells are involved, or 
the like. 

Telomerase inhibitory nucleic acids are typically antisense: single-stranded 
sequences comprising complements of the disclosed natural telomerase coding sequences. 

25 Antisense modulation of the expression of a given telomerase protein may employ 

telomerase antisense nucleic acids operably linked to gene regulatory sequences. Cell arc 
transfected with a vector comprising a human telomerase sequence with a promoter 
sequence oriented such that transcription of the gene yields an antisense transcript capable 
of binding to endogenous human telomerase protein encoding mRNA. Transcription of the 

30 antisense nucleic acid may be constitutive or inducible and the vector may provide for 
stable extrachromosomal maintenance or integration. Alternatively, single-stranded 
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antisense nucleic acids that bind to genomic DNA or mRNA encoding a given human 
telomerase protein may be administered to the target cell, in or temporarily isolated from a 
host, at a concentration that results in a substantial reduction in expression of the targeted 
protein. 

In other indications, e.g. certain hypersensitivities, atrophic diseases, etc.. an 
5 increase in cell growth or proliferation is desired. In these applications, an enhancement in 

human telomerase expression is effected by introducing into the targeted cell type human 
telomerase nucleic acids which increase the functional expression of human telomerase 
gene products. Conditions for treatment include multiple sclerosis, where certain neuronal 
cells are involved, inflammatory disease states such as rheumatoid arthritis, where 

10 bystander cells are involved, transplant rejection where graft cells are involved, infectious 

diseases such as HIV infection where certain uninfected host cells are involved, or the like. 
Such nucleic acids may be human telomerase expression vectors, vectors which upregulate 
the functional expression of an endogenous human telomerase allele, or replacement vectors 
for targeted correction of human telomerase mutant alleles. 

1 5 Various techniques may be employed for introducing of the nucleic acids into viable 

cells, e.g. transfection with a retrovirus, viral coat protein-liposome mediated transfection. 
The techniques vary depending upon whether one is using the subject compositions in 
culture or in vivo in a host. In some situations it is desirable to provide the nucleic acid 
source with an agent which targets the target cells, such as an antibody specific for a surface 

20 membrane protein on the target cell, a ligand for a receptor on the target cell, etc. Where 

liposomes are employed, proteins which bind to a surface membrane protein associated with 
endocytosis may be used for targeting and/or to facilitate uptake, e.g. capsid proteins or 
fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 
internalization in cycling, proteins that target intracellular localization and enhance 

25 intracellular half-life. 

The invention provides methods and compositions for enhancing the yield of many 
recombinantly produced proteins by increasing maximum cell densities and survival time of 
host production cells in culture. Specifically, cultured cells are transfected with nucleic 
acids which effect the up-regulation of endogenous telomerase or the expression of an 

30 exogenous telomerase. For example, nucleic acids encoding functional human telomerase 

operably linked to a transcriptional promoter are used to over-express the exogenous 
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telomerase in the host cell. Telomerase-expressing cells demonstrate enhanced survival 
ability at elevated cell densities and over extended culture periods. 

The invention provides efficient methods of identifying agents, compounds or lead 
compounds for agents active at the level of a human telomcrase modulatable cellular 
function. Generally, these screening methods involve assaying for compounds which 
modulate human telomerase interaction with a natural human telomerase binding target. A 
wide variety of assays for binding agents are provided including labeled in vitro telomere 
polymerase assays, protein-protein binding assays, immunoassays, cell based assays, etc. 
The metho^ji^ aijjei^it Jo^uJomAtei- cost-effective h'^h tu^nu^t ~r 



10 



15 



20 



25 



30 



chemical libraries for lead compounds. Identified reagents find use in the pharmaceutical 
industries for animal and human trials; for example, the reagents may be derivatized and 
rescreened in in vitro and in vivo assays to optimize activity and minimize toxicity for 
pharmaceutical development. Target indications may include infection, genetic disease, cell 
growth and regulatory disflmction, such as neoplasia, inflammation, hypersensitivity, etc. 
Target cells also include progenitor cells for repopulating blood or bone marrow, tissue 
grafts, and tissue subject to degredation/high turnover such as digestive and vascular 
endothelia and pulmunary and dermal epithelia. 

In vitro binding assays employ a mixture of components including a human 
telomerase protein, which may be part of multi-subunit telomerase. a fusion product with 
another peptide or polypeptide, e.g. a tag for detection or anchoring, etc. The assay 
mixtures comprise a natural intracellular human telomerase binding target, e.g. a substrate. 
While native binding targets may be used, it is frequently prefened to use portions (e.g. 
peptides, nucleic acid fragments) thereof so long as the portion provides binding affinity 
and avidity to the subject human telomerase conveniently measurable in the assay. The 
assay mixture also comprises a candidate pharmacological agent. Candidate agents 
encompass numerous chemical classes, though typically they are organic compounds; 
preferably small organic compounds and are obtained from a wide variety of sources 
including libraries of synthetic or natural compounds. A variety of other reagents may also 
be included in the mixture. These include reagents like salts, buffers, neutral proteins, e.g. 
albumin, detergents, protease inhibitors, nuclease inhibitors, antimicrobial agents, etc. may 
be used. 

The resultant mixture is incubated under conditions whereby, but for the presence of 
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the candidate pharmacological agent, the human telomere specifically binds the cellular 
binding target, portion or analog with a reference binding affinity. The mixture components 
can be added in any order that provides for the requisite bindings and incubations may be 
performed at any temperature which facilitates optimal binding. Incubation periods are 
likewise selected for optimal binding but also minimized to facilitate rapid, high-throughput 
5 screening. 

After incubation, the agent-biased binding between the human telomerase and one or 
more binding targets is detected by any convenient way. For cell-free binding type assays, 
a separation step is often used to separate bound from unbound components. Separation 
may be effected by precipitation (e.g. TCA precipitation, immunoprecipitation. etc.), 
immobilization (e.g on a solid substrate), etc.. followed by washing by, for examples, 
membrane filtration (e.g. Whatman's P-81 ion exchange paper. Polyfiltronic's hydrophobic 
GFC membrane, etc.), gel chromatography (e.g. gel filtration, affinity, etc.). For telomere 
polymerase assays, binding is detected by a change in the polymerization by the telomerase 
of a nucleic acid or nucleic acid analog on the substrate. 

Detection may be effected in any convenient way. For cell-free binding assays, one 1 5 

of the components usually comprises or is coupled to a label. The label may provide for 
direct detection as radioactivity, luminescence, optical or electron density, etc. or indirect 
detection such as an epitope tag. an enzyme, etc. A variety of methods may be used to 
detect the label depending on the nature of the label and other assay components, e.g. 
through optical or electron density, radiative emissions, nonradiative energy transfers, etc. 
or indirectly detected with antibody conjugates, etc. 

A difference in the binding affinity of the human telomerase protein to the target in 
the absence of the agent as compared with the binding affinity in the presence of the agent 
indicates that the agent modulates the binding of the human telomerase protein to the 
human telomerase binding target Analogously, in the cell-based transcription assay also j 25 

described below, a difference in the human telomerase transcriptional induction in the 
presence and absence of an agent indicates the agent modulates human telomerase-induced 
transcription. A difference, as used herein, is statistically significant and preferably 
represents at least a 50%, more preferably at least a 90% difference. 

The following examples are offered by way of illustration and not by way of 3( 
limitation. 
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EXAMPLES 

1 . Protocol for high-throughput human telomere polymerization assay. 
A. ReagSnts: 

- Neutralite Avj^in: 20 pg/ml in PBS. 

- human telomerage: I0 a - 10' 5 M human telomerase in PBS. 

5 - Stacking buffer 5% BSA t 0.5% Tween 20 in PBS; I hour at room temperature. 

- A33SY Buffer 100 mM KCI, 20 mM HEPES pH 7.6, 1 mM MgCh, 1 mM dATP, 1 
mM dTTP, 1% glycerol, 0.5% NP-40, 50 mM BME, 1 mg/ml BSA t cocktail of protease 
inhibitors. 

-LlfElo^GTPJflaLSffick: 2 x lfr 5 M "cold" dGTP with 100 pCi [ "P]a-dGTP. 
1 0 Place in the 4°C microfridge during screening. 

* telomere subSTratg: I0" 7 - lO^M biotinylated telomerase substrate (5'-biotin- 
d(TTAGGGV3']inPBS. 

- Protease inhibitor cocktail flOQOX): 10 mg Trypsin Inhibitor (BMB # 109894), 10 
mg Aprotinin (BMB # 236624), 25 mg Benzamidinc (Sigma # B-6506), 25 mg Leupeptin 

15 (BMB # 1017128), 10 mg APMSF (BMB # 917575), and 2mM NaVo 3 (Sigma # S-6508) in 

10 ml of PBS. 

B. Preparation of assay plates: 

- Coat with 120 pi of stock N Avidin per well overnight at 4°C 

- Wash 2 times with 200 pi PBS. 

20 - Block with 150 pi of blocking buffer. 

■ Wash 2 times with 200 pi PBS. 

C. Assay; 

• Add 40 til assay buffer/well. 

- Add 40 pi human telomerase (1-1000 finoles/40 ul in assay buffer) 
25 - Add 10 pi compound or extract. 

- Add 10 pi [ ,2 P]o-dGTP lOx stock. 

- Add 40 pi biotinylated telomerase substrate (0.1-10 pmoles/40 ul in assay buffer) 

- Shake at 25 °C for 15 minutes. 

* Incubate additional 45 minutes at 25 °C. 

30 - Stop the reaction by washing 4 times with 200 pi PBS. 

• Add 150 pi scintillation cocktail. 
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- Count in Topcount. 
D. Controls for all assays (located on each plate): 

a. Non-specific binding 

b. cold dGTP at 80% inhibition. 

2. Protocol for high throughput human telomerase subunit- RNA complex formation 
assay. 

A. Reagents: 

- NeutralitC Avi(tin: 20 ug/ml in PBS. 

- Blocking buffer : 5% BSA, 0.5% Tween 20 in PBS; 1 hour at room temperature. 

- Assay Btiffffr 100 mM KC1, 20 mM HEPES pH 7.6, 1 mM MgCl,, 1% glycerol, 
0.5% NP-40, 50 mM p-mercaptoethanol, 1 mg/ml BSA, cocktail of protease inhibitors. 

" "P human telomerase ? mt^ jffr ^fr - iQr» . jo^m "cold" human telomerase 
subunit (pl05) supplemented with 200,000-250.000 cpm of labeled human telomerase 
(Bectanan counter). Place in the 4°C mkrofridge during screening. 

■ ProtMSC inhihitor cnf.ktnil nOflflTQ : 10 mg Trypsin Inhibitor (BMB # 109894), 10 
mg Aprotinin (BMB # 236624), 25 m S Bemamidine (Sigma # B-6506), 25 mg Leupeptin 
(BMB # 1017128). 10 mg APMSF (BMB # 917575), and 2mM NaVo, (Sigma # S-6508) in 
10 ml of PBS. 

- tclomcffly RNA: 10* 7 - lO^M biotinylated RNA (SEQ ID NO:6) in PBS. 
20 B. Preparation of assay plates: 

- Coat with 120 ul of stock N-Avidin per well overnight at 4*C. 

- Wash 2 times with 200 ul PBS. 

• Block with 150 ul of blocking buffer. 

- Wash 2 times with 200 ul PBS. 
25 C. Assay: 

- Add 40 al assay buffer/well. 

• Add 10 |il compound or extract. 

- Add 10 Ml »P-human telomerase protein (20,000-25.000 cpm/0. 1-10 pmoles/well 
-lO*- 10 -7 M final concentration). 

30 - Shake at 25 8 C for 1 5 minutes. 

• Incubate additional 45 minutes at 25 °C. 

14 
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- - Add 40 jal biotinylated RNA (0. 1 - 1 0 pmoIes/40 ul in assay buffer) 

- Incubate 1 hour at room temperature, 

- Stop the reaction by washing 4 times with 200 \xl PBS. 

- Add 1 SO til scintillation cocktail. 

- Count in Topcount. 

D. Controls for all assays (located on each plate): 

a. Non-specific binding 

b. Soluble (non-biotinylated telomerase) at 80% inhibition. 

All publications and patent applications cited in this specification are herein 
incorporated by reference as if each individual publication or patent application were 
specifically and individually indicated to be incorporated by reference. Although the 
foregoing invention has been described in some detail by way of illustration and example 
for purposes of clarity of understanding, it will be readily apparent to those of ordinary skill 
in the art in light of the teachings of this invention that certain changes and modifications 
may be made thereto without departing from the spirit or scope of the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(I) APPLICANT: Kathleen Collins 
(ii) TITLE OF INVENTION; Human Telomerase 
Ciii) NUMBER OF SEQUENCES: 10 
5 iiv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: science & Technology Law Group 
fB) STREET: 268 Bush Street, Suite 3200 
<C) CITY: San Francisco 
(D) STATE: CA 
10 (EJ COUNTRY: USA 

(F) ZIP: 94104 
(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

15 (C) OPERATING SYSTEM: PC -DOS /MS - DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 
(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
(8) FILING DATE: 
20 (C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Osman Ph.D., Richard A 

(B) REGISTRATION NUMBER: 36,627 

(C) REFERENCE/ DOCKET NUMBER: UCB96-055 
25 (ix) TELECOMMUNICATION INFORMATION ; 

(A) TELEPHONE: (415)343-4341 

(B) TELEFAX: <41S ) 343 -4342 

(2) INFORMATION FOR SEQ ID NO:l: 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: not relevant 
35 (ii) MOLECULE TYPE: peptide 

!xi) SEQUENCE DESCRIPTION : SEQ ID NO:l: 
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Met Ala Qly Leu Thr Leu Phe Val Gly Arg Leu Pro Pro Ser Ala Arg 

15 10 is 

Ser Glu Gin Leu GIu Glu Leu Phe Ser Gin Val Gly Pro Val Lys Gin 

20 25 30 

Cys Phe Val val Thr Glu Lys Gly Ser Lys Ala Cys Arg Gly Phe Gly 

35 40 45 

Tyr Val Thr Phe Ser Met Leu Glu Asp Val Gin Arg Ala Leu Lys Glu 

50 55 SO 

lie Thr Thr Phe Glu Gly Cys Lys lie Asn Val Thr Val Ala Lys Lys 
65 70 75 80 

Lys Leu Arg Asn Lys Thr Lys Glu Lys Gly Lys Asn Glu Asn Ser Glu 

85 90 95 

Cys Pro Lys Lys Glu Pro Lys Ala Lys Lys Ala Lys Val Ala Asp Lys 

100 10S no 

Lys Ala Arg Leu He He Arg Asn Leu Ser Phe Lys Cys Ser Glu Asp 

115 120 12S 

Asp Leu Lys Thr val Phe Ala Gin Phe Gly Ala val Leu Glu val Asn 

X30 135 140 

He Pro Arg Lys Pro Asp Gly Lys Met Arg Gly Phe Gly Phe Val Gin 
145 150 155 160 

Phe Lys Asn Leu Leu Glu Ala Gly Lys Ala Leu Lys Gly Net Asn Met 

165 170 175 

Lys Glu He Lys Gly Arg Thr Val Ala Val Asp Trp Ala Val Ala Lys 

180 185 190 

Asp Lys Tyr Lys Asp Thr Gin Ser Val Ser Ala He Gly Glu Glu Lys 

195 200 205 

Ser His Glu Ser Lys His Gin Glu Ser Val Lye Lys Lys Gly Arg Glu 

210 215 220 

Glu Glu Asp Met Glu Glu Glu Glu Asn Asp Asp Asp Asp Asp Asp Asp 
225 230 233 240 

Asp Glu Glu Asp Gly Val Phe Asp Asp Glu Asp Glu Glu Glu Glu Asn 

245 250 255 

He Glu Ser Lys Val Thr Lys Pro Val Gin He Gin Lys Arg Ala Val 

260 26S 270 

Lys Arg Pro Ala Pro Ala Lys Ser Ser Asp His Ser Glu Glu Asp S«r 

275 280 285 

Asp Leu Glu Glu Ser Asp Ser He Asp Asp Gly Clu Glu Leu Ala Gin 
290 295 300 
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- Ser Asp Thr Ser Thr Glu Glu Gin Glu Asp Lys Ala Val Gin Val Ser 
305 310 31S 320 

Asn Lys Ly S Lys Arg Lys Leu Pro Ser Asp Val Asn Glu Gly Lys Thr 

32s 330 335 

Val Phe lie Arg Asn Leu Ser Phe Asp Ser Glu Glu Glu Glu Leu Gly 

340 34S 3S0 

Glu Leu Leu Gin Gin Phe Gly Glu Leu Lys Tyr Val Arg He Val Leu 



Met 



355 360 365 

His Pro Asp xhr Glu His Ser Lys Gly Cys Ala Phe Ala Gin Phe 

370 375 380 

Thr Gin Glu Ala Ala Gin Ly 3 Cys Leu Leu Ala Ala Ser Pro Glu Asn 
385 3 *° 39S 400 

Glu Ala Gly Gly Leu Lys Leu Asp Gly Arg Gin Leu Lys Val Asp Leu 

405 410 415 

Ala Val Thr Arg Asp Glu Ala Ala Lys Leu Gin Thr Thr Lys Val Lys 

«0 425 430 

Lys Pro Thr Gly Thr Arg Asn Leu Tyr Leu Ala Arg Glu Gly Leu lie 

435 440 
Arg Ala Gly Thr Lys Ala Ala Glu Gly val Ser Ala Ala Aap Met Ala 

4S0 . 455 460 

t,ys Arg Glu Arg Phe Glu Leu Leu Lys His Oln Lys Leu Lys Asp Gin 
465 470 475 480 

Asn lie Phe Val Ser Arg Thr Arg Leu Cys Leu His Asn Leu Pro Lys 

48 S 490 49S 

Ala Val Asp Asp Lye Gin Leu Arg Lya Leu Leu Leu Ser Ala Thr Ser 

300 505 510 

Gly Glu Lys Gly Val Arg lie Lys Glu Cy 9 Arg Val Met Arg Asp Leu 

SIS 520 
Lye Gly Val His Gly Asn Met Lys Gly Gin Ser Leu Gly Tyr Ala Phe 

S30 S3S 540 

Ala Glu Phe Gin Glu His Glu His Ala Leu Ly 3 Ala Leu Arg Leu lie 
545 550 S5S 560 

Asn Asn Asn Pro Glu lie Phe Gly Pro Leu Lya Arg Pro lie Val Glu 

S6S 570 575 

Phe Ser Leu Glu Asp Arg Arg Lys Leu Lys Met Lys Glu Leu Arg lie 

580 585 S90 

Gin Arg Ser Leu Gin Lys Met Arg Ser Lys Pro Ala Thr Gly Glu Pro 
59S 600 sos 
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- Gin Lys Gly Gin Pro Glu Pro Ala Lys Asp Gin Gin Gin Lys Ala Ala 
610 615 620 

Gin His His Thr Glu Glu Gin Ser Lys Val Pro Pro Glu Gin Lys Arg 
S2S 630 635 640 

Lys Ala Gly Ser Thr Ser Trp Thr Gly Phe Gin Thr Lys Ala Glu Val 

645 650 655 

Glu Gin Val Glu Leu Pro Asp Gly Lys Lys Arg Arg Lys Val Leu Ala 

660 665 670 

Leu Pro Ser His Arg Gly Pro Lys lie Arg Leu Arg Asp Lys Oly Lys 

675 680 685 

Val Lys Pro Val His Pro Lys Lys Pro Lys Pro Gin He Asn Gin Trp 

690 695 700 

Lys Gin Glu Lys Gin Gin Leu Ser ser Glu Gin val ser Arg Lys Lys 
705 710 715 720 

Ala Lys Gly Asn Lys Thr Glu Thr Arg Phe Asn Gin Leu Val Glu Gin 

725 730 735 

Tyr Lys Gin Lys Leu Leu Gly Pro Ser Lys Gly Ala Pro Leu Ala :.ys 

740 745 750 

Arg Ser Lys Trp Phe Asp Ser 
755 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
ATGGCNGGNY TNACNYTNTT YGTNGGNMGN YTNCCNCCNW SNGCNMGNWS NGARCARYTN 60 
GARGARYTNT TYWSNCARGT NGGNCCNGTN AARCARTGYT TYGTNGTNAC NGARAARGGN 120 
WSNAARGCNT GYMGNGGNTT YCGNTAYCTN ACNTTYWSNA TGYTNGARGA YGTNCARMGN 180 
GCNYTNAARG ARATHACNAC NTTYGARGGN TCYAARATHA AYGTNACNGT NGCMAARAAR 240 
AARYTNMONA AYAARACNAA RGARAARGGN AARAAYGARA AYWSNGARTG YCCNAARAAR 3 00 

GARCCNAARC CNAARAARGC NAARGTNGCN GAYAARAARG CNMGNYTNAT HATHMGNAAV 360 
YTNWSNTTYA ARTGYWSNGA RGAYGAYYTN AARACNCTNT TYGCNCARTT YGGNGCNGTK 420 
YTNGARGTNA AYATHCCNMG NAARCCNGAY GGNAARATGM GNGGNTTYGG NTTYGTNCAR 480 
TTYAARAAYY TNYTNGAROC NGGNAARGCN YTNAARGCNA TGAAYATGAA RGARATHAAR 540 
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GSNMGNACNG 
GTNWSNGCNA 
AARGGNWGNG 
GAYGARGARG 
GTNACNAARC 
WSNGAYCAYW 
GARYTNGCNC 
AAYAARAARA 
AAYYTNWSNT 
YTNAARTAYG 
GCNCARTTYA 
GARGCNGGNG 
GAYGARGCNG 
TAYYTNGCNM 
GCNGAYATGG 
AAYATHTTYG 
AARCARYTOM 
GARTGYMGNG 
CGNTAYGCNT 
AAYAAYAAYC 
GAYMGNMCNA 
WSNAARCCNG 
CARAARGCNG 
AARGCNGGNW 
YTNCCNGAYG 
ATHMGNYTNM 
ATHAAYCART 
GCNAARGGNA 
YTNYTNGGNC 
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TNGCNGTNGA 
THGGNGARGA 
ARGARGARCA 
AYGGNGTNTT 
CNGTNCARAT 
SNGARGARGA 
ARWSNGAYAC 
ARMGNAARYT 
TYGAYWSNGA 
TOMGNATHGT 
TGACNCARGA 
GNYTNAARYT 
CNAARYTNCA 
GNGARGGNYT 
CNAARMGNGA 
TNWSNMGNAC 
GNAARYTNYT 
TNATGMGNGA 
TYGCNGARTT 
CNGARATHTT 
ARYTNAARAT 
CNACNGGNGA 
CNCARCAYCA 
SNACNWSNTG 
GNAARAA&MG 
GNGAYAARCG 
GGAARCARGA 
AYAARACNGA 
CNWSNAARGG 



YTGGGCNGTN 
&AARWSNCAY 
YATGGARGAR 
YGAYGAYGAR 
HCARAARMGK 
YWSNGAYYTN 
NWSNACNGAR 
NCCNWSNGAY 
RGARGARGAR 
NYTNCAYCCN 
RGCNGCNCAR 
NGAYGGNMGN 
RACNACNAAR 
NATHMGNGCN 
RMGNTTYGAR 
NMGNYTNTGY 
NYTNWSNGCN 
YYTNAARGGN 
YCARGARCAY 
YGG^CCNYTN 
GAARGARYTN 
RCCNCARAAR 
YACNGARGAR 
GACNGGNTTY 
NMGNAARGTN 
KAARGTNAAR 
RAARCARCAR 
RACNMGNTTY 
NGCNCCNYTN 



GCNAARGAYA 
GARWSNAARC 
GARGARAAYG 
GAYGARGARG 
GCNGTNAARM 
GARGARWSNG 
GARCARGARG 
GTNAAYGAKC 
YTNGGNGARY 
GAYACNGARC 
AARTGYYTNY 
CARYTNAARG 
GTNAARAARC 
GGNACNAARG 
YTNYTNAARC 
YTNCAYAAYY 
ACNWSNGGNG 
GTNCAYGGNA 
GARCAYGCNY 
AARMGNCCNA 
MGNATHCARM 
GGNCARCCNG 
CARWSNAARG 
CARACNAARG 
YTNGCNYTNC 
CCNGTNCAYC 
YTNWSNWSNG 
AAYCARYTNG 
GCNAARMGNW 



ARTAYAARGA 
AYCARGARWS 
AYGAYGAYGA 
ARGARAAYAT 
GNCCNGCNCC 
AYWSNATHGA 
AYAARGCNGT 
GNAARACNGT 
TNYTNCARCA 
AYWSNAARGG 
TNGCNGCNWS 
TNGAYYTNGC 
CNACNGGNAC 
CNGCNGARGG 
AYCARAARYT 
TNCCNAARGC 
ARAARGGNGT 
AYATGAARGG 
TNAARGCNYT 
THGTNGARTT 
GNWSNYTNCA 
ARCCNGCNAA 
TNCCNCCNGA 
CNGARGTNGA 
CNWSNCAYMG 
CNAARAARCC 
ARCAROTNWS 
TNGARCARTA 
SNAARTGGTT 



YACNCARWSN 
WGTNAARAAR 
YGAYGAYGAY 
HGARWSNAAR 
NGCNAARWSN 
YGAYGGNGAR 
NCARGTNWSN 
NTTYATHMGN 
RTTYGGNGAR 
NTGYGCNTTY 
NCCNGARAAY 
NGTNACNMGN 
NMGNAAYYTN 
NGTNWSNGCN 
NAARGAYCAR 
NGTNGAYGAY 
NMGNATHAAR 
NCARWSNYTN 
NMGNYTNATH 
YWSNYTNGAR 
RAARATGMGN 
RGAYCARCAR 
RCARAARMGN 
RCARGTNOAR 
NGGNCCNAAR 
NAARCCNCAR 
NMGNAARAAR 
YAARCARAAR 
YGAYWSN 
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(2) INFORMATION FOR SEQ ID NO;3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:3: 



soo 

660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2277 
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TGAGCTTGGT 
CGCGTCCTCA 
CGCCTCCCGC 
GTGAAGCAGT 
GTCACTTTTT 
GGTTGCAAGA 
GGGAAAAATG 
GCAGATAAGA 
TTGAAGACAG 
GATGGGAAGA 
GCTCTCAAAG 
GTGGCAAAGG 
CATGAATCTA 
GAGGAAGAAA 
GAAGATGAAG 
AGAGCAGTCA 
CTAGAGGAAA 
GAGGAGCAAG 
GATGTGAATG 
GAACTTGGGG 
CCAGACACAG 
CAGAAATGCC 
CGGCAGCTCA 
AAGGTGAAGA 
GCTGGGACGA 
GAGCTGCTGA 
TGCCTGCACA 
GCTACTAGTG 
GGAGTTCATG 
CACGAGCATG 
CTGAAGAGAC 
TTAAGGATCC 
AAGGGGCAAC 
GAACAAAGCA 
TTCCAGACCA 
GTCCTGGCGC 
AAGCCCGTCC 
CAATTATCGT 



TGTCCTACCA 
CGCGCTTTGG 
CCTCGGCCCG 
GCTTCGTGGT 
CAATGCTGGA 
TCAACGTGAC 
AAAACTCAGA 
AAGCCAGATT 
TATTTGCTCA 
TGCGCGGTTT 
GCATGAACAT 
ATAAATATAA 
AACATCAGGA 
ACGATGATGA 
AGGAAGAGAA 
AGAGACCAGC 
GCGATAGTAT 
AGGATAAAGC 
AAGGGAAAAC 
AGCTTCTCCA 
AGCATTCTAA 
TTCTAGCTGC 
AGGTTGACTT 
AGCCGACTGG 
AGGCTGCAGA 
AGCATCAGAA 
ATCTCCCAAA 
GAGAGAAAGG 
GGAACATGAA 
CCCTGAAAGC 
CAATAGTGGA 
AGCGCAGCTT 
CAGAGCCTGC 
AGGTGCCCCC 
AGGCTGAAGT 
TCCCCTCACA 
ATCCCAAAAA 
CCGAGCAGGT 



AAGCCAGCGT 
GTTTCCCGGT 
CAGTGAGCAG 
GACTGAAAAA 
AGATGTTCAG 
TGTTGCCAAG 
GTGCCCAAAG 
AATTATTCGG 
ATTTGGAGCT 
TGGTTTTGTT 
GAAAGAGATA 
AGATACACAG 
ATCAGTTAAA 
TGACGATGAT 
TATAGAATCA 
CCCTGCAAAA 
TGATGATGGA 
TGTGCAAGTC 
TGTTTTTATC 
ACAGTTTGGA 
AGGTTGTGCA 
TTCTCCAGAG 
GGCGGTGACC 
CACCCGGAAT 
GGGTGTGAGT 
ACTCAAGGAC 
GGCTGTAGAT 
GGTGCGCATC 
GGGTCAGTCC 
CCTCCGCCTC 
GTTCTCTTTA 
GCAAAAAATG 
AAAAGACCAG 
AGAGCAGAAG 
GGAGCAGGTG 
CCGAGGCCCC 
GCCAAAGCCA 
ATCTAGGAAA 



TTCGGCTCGC 
CTCATGGCCG 
CTGGAGGAAC 
GGG AGTAAGG 
AGGGCCCTCA 
AAAAAACTGA 
AAGGAGCCGA 
AACCTGAGCT 
GTCCTGGAAG 
CAGTTCAAAA 
AAAGGCCGGA 
TCTGTTTCTG 
AAGAAGGGCA 
GATGATGAAG 
AAGGTGACCA 
AGCAGTGATC 
GAGGAACTGG 
TCAAACAAAA 
AGAAATCTGT 
GAACTCAAAT 
TTTGCCCAGT 
AATGAGGCTG 
CGTGATGAGG 
CTCTATCTGG 
GCTGCTGATA 
CAGAATATCT 
GACAAACAGC 
AAGGAGTGTA 
CTGGGCTACG 
ATCAACAACA 
GAAGATCGAA 
AGATCCAAGC 
CAACAGAAGG 
AGAAAGGCGG 
GAGCTGCCTG 
AAAATCAGGT 
CAGATAAACC 
AAAGCTAAGG 
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GTGCGCCGGC 
GCCTGACCTT 
TGTTCAGTCA 
CATGTCGAGG 
AGGAGATTAC 
GGAACAAGAC 
AGGCTAAAAA 
TTAAGTGTTC 
TAAATATCCC 
ACCTCCTAGA 
CAGTGGCTGT 
CTATAGGTGA 
GAGAGGAAGA 
AAGATGGGGT 
AGCCTGTGCA 
ATTCTGAGGA 
CTCAGAGTGA 
AGAAGAGGAA 
CCTTTGACTC 
ATGTCCGCAT 
TCATGACTCA 
GTGGGCTTAA 
CTGCAAAGCT 
CCCGAGAAGG 
TGGCCAAAAG 
TTGTCTCCCG 
TCAGAAAGCT 
GAGTGATGCG 
CCTTTGCGGA 
ATCCAGAAAT 
GAAAACTTAA 
CTGCAACTGG 
CAGCTCAACA 
GCTCTACCTC 
ATGGAAAGAA 
TGCGGGACAA 
AGTGGAAGCA 
GAAATAAGAC 



CTAGTTTGCT 
ATTTGTGGGC 
GGTGGGGCCG 
CTTTGGCTAT 
CACCTTTGAA 
AAAGGAAAAG 
AGCCAAAGTG 
AGAAGATGAC 
TAGGAAACCA 
AGGAGGTAAA 
GGATTGGGCC 
GGAAAAGAGC 
GGATATiGGAA 
TTTTGATGAT 
AATTCAGAAG 
GGACAGTGAC 
TACCAGCACT 
ATTACCCTCT 
AGAAGAAGAA 
TGTCTTGCAT 
AGAAGCAGCT 
ACTGGATGGC 
TCAGACGACG 
CTTGATTCGT 
AGAACGGTTT 
AACCAGGCTC 
GCTGCTGAGT 
AGACCTCAAA 
GTTCCAAGAG 
CTTTGGGCCT 
AATGAAGGAA 
TGAGCCTCAG 
CCACACAGAG 
ATGGACCGGG 
GAGAAGAAAG 
AGGCAAAGTG 
GGAGAAGCAG 
GGAAACCCGC 



60 
120 
160 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
13 80 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
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TTCAACCAGC TGGTCGAACA ATATAAGCAG AAATTATTGG GACCTTCTAA AGGAGCACCT 2 3 40 

CTTGCAAAGA GGAGCAAATG GTTTGATAGT TGATGATGGC AGCAGGCTGG GTAAGAAGCT 24 00 

GGGTTGTATA CTTTCTGGTG ACACTCCTGG GCTCCTCCCC ATCCCCCGTG TCTCTCACTG 24 60 

AGGGAAAGAA AATCCCCAAG GGCACTGCCA CTGTGCTCGG AGGTGCCCTG GACTGTGTAC 25 20 

ATCTGAACTT TGGTCCATCC TTTGATGTGT GGTTCGTTAG CCACAAAGAG AAATATCTGA 2500 

AAGTCAACAT GATGCTTCTT GCATATTATC CAGATTATTG TATGAAGTTG TGTCTATAAT 264 0 

TATTACCAAT TTTTATTCTT TATTTCTCAA ATGGAAACAC CTGAAAAAGC AAAAAAAAAA 2700 

AAAAAAAAAA CTCGAGGGGG GCCCGTACCC AAT 2733 



(2) INFORMATION POR SEQ ID N0:4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2277 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
<ii> MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

AUGGCUGGUC UGACCCUGUU CGUUGGUCGU CUGCCGCCGU CCGCUCGUUC CGAACAGCUG 60 

GAAGAACUGU UCUCCCAGGU UGGUCCGGUU AAACAGUGCU UCGUUGUUAC CGAAAAAGGU 120 

UCCAAAGCUU GCCGUGGUUU CGGUUACGUU ACCUUCUCCA UGCUGGAAGA CGUUCAGCGU 180 

GCUCUGAAAG AAAUCACCAC CUUCGAAGGU tJGCAAAAUCA ACGUUACCGU UGCUAAAAAA 240 

AAACUGCGUA ACAAAACCAA AGAAAAAGGU AAAAACGAAA ACUCCGAAUG CCCGAAAAAA 300 

GAACCGAAAG CUAAAAAAGC UAAAGUUGCU GACAAAAAAG CUCGUCUGAU CAUCCGUAAC 360 

CUGUCCUUCA AAUGCUCCGA AGACGACCUG AAAACCGUUU UCGCUCAGUU CGGUGCUGUU 420 

CUGGAAGUUA ACAUCCCGCG UAAACCGGAC GGUAAAAUGC GUGGUUUCGG UUUCGUUCAG 4 80 

UUCAAAAACC UGCUGGAAGC UGGUAAAGCU CUGAAAGGUA UGAACAUGAA AGAAAUCAAA 54 0 

GGUCGUACCG UUGCUGUUGA CUGGGCUGUU GCUAAAGACA AAUACAAAGA CACCCAGUCC 600 

GUUUCCGCUA UCGGUGAAGA AAAAUCCCAC GAAUCCAAAC ACCAGGAAUC CGUUAAAAAA 660 

AAAGGUCGUG AAGAAGAAGA CAUGGAAGAA GAAGAAAACG ACCACGACGA CGACGACGAC 720 

GACGAAGAAG ACGGUGUUUU CGACGACGAA GACGAAGAAG AAGAAAACAU COAAUCCAAA 780 

GUUACCAAAC CGGUUCAGAU CCAGAAACGU GCUGUUAAAC GUCCGGCUCC GGCUAAAUCC 840 

UCCGACCACU CCGAAGAAGA CUCCGACCUG GAAGAAUCCG ACUCCAUCGA CGACGGUGAA 900 

GAACUGGCUC AGUCCGACAC CUCCACCGAA GAACAGGAAG ACAAAGOJGU UCAGGUUUCC 960 

AACAAAAAAA AACGUAAACU GCCGUCCGAC GUUAACGAAG GUAAAACCGU UUUCAUCCGU 1020 

AACCUGUCCU UCGACUCCGA AGAAGAAGAA CUGGGUGAAC UGCUGCAGCA GUUCGGUCAA 1080 

CUGAAAUACC UUCCUAUCGU UCUGCACCCG GACACCGAAC AOJCCAAAGG UUGCGCUUUC 1140 

GCUCAGUUCA UGACCCAGGA AGCUGCUCAG AAAUGCCUGC UGGCUGCTUC CCCGGAAAAC 1200 

GAAGCUGGUG GUCUGAAACU GGACGGUCGU CAGCUGAAAG UUGACCUGGC UGUUACCCCU 1260 
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GAC9AAGCUG 


CUAAACUGCA 


GACCACCAAA 


GUUAAAAAAC 


CGACCGGUAC 


CCGUAACCUG 


1320 


UACCUGGCUC 


GUGAAGGUCU 


GAUCCGUGCU 


GGUACCAAAG 


CUGCUGAAGG 


UGUUUCCCCU 


1380 


GCUGACAUGG 


CUAAACGUGA 


ACGUUUCGAA 


CUGCUGAAAC 


ACCAGAAACU 


GAAAGACCAG 


1440 


AACAUCUUCG 


UUUCCCGUAC 


CCGUCUGUGC 


CUGCACAACC 


UGCCGAAAGC 


UGUUGACGAC 


1500 


AAACAGCUGC 


GUAAACUGCU 


GCUGUCCCCU 


ACCUCCGGUG 


AAAAAGGUGU 


UCGUAUCAAA 


1560 


GAAUGCCGUG 


UUAUGCGUGA 


CCUGAAAGGU 


GUUCACGGUA 


ACAUGAAAGG 


UCAGUCCCUG 


1620 


GGUUACGCUU 


UCGCUGAAUU 


CCAGGAACAC 


GAACACGCUC 


UGAAAGCUCU 


GCGUCUGAUC 


1680 


AACAACAACC 


CCGAAAUCUU 


CGGUCCGCUG 


AAACGUCCGA 


UCGUUGAAUU 


CUCCCUGGAA 


1740 


GACCGUCGUA 


AACUGAAAAU 


GAAAGAACUG 


CGUAUCCAGC 


GUUCCCUGCA 


GAAAAUGCGU 


1800 


UCCAAACCGG 


CUACCGGUGA 


ACCGCAGAAA 


GGUCAGCCGG 


AACCGGCUAA 


AGACCAGCAG 


I860 


CAGAAAGCUG 


CUCAGCACCA 


CACCGAAGAA 


CAGUCCAAAG 


UUCCGCCGGA 


ACAGAAACGU 


1920 


AAAGCUGGUU 


CCACCUCCUG 


GACCGGUUUC 


CAGACCAAAG 


CUGAAGUUGA 


ACAGGUUGAA 


1980 


CUGCCGGACG 


GUAAAAAACG 


UCGUAAAGUU 


CUGGCUCUGC 


CGUCCCACCG 


UGGUCCGAAA 


2040 


AUCCGUCUGC 


GUGACAAAGG 


UAAAGUUAAA 


CCGGUUCACC 


CGAAAAAACC 


GAAACCGCAG 


2100 


AUCAACCAGU 


GGAAACAGGA 


AAAACAGGAG 


CUOUCCUCCG 


AACAGCUUUC 


CCGUAAAAAA 


2160 


GCUAAAGGUA 


ACAAAACCGA 


AACCCGUUUC 


AACCAGCUGG 


UUGAACAGUA 


CAAACAGAAA 


2220 


CUGCUaOGUC 


CGUCCAAAGG 


UGCUCCGCUG 


CCUAAACGUU 


CCAAAUGGUU 


CGACUCC 


2277 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 2277 bace pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

25 (xi) SEQUENCE DESCRIPTION; SEQ ID NO: 5: 



ATGGCCGGCC 


TGACCCTGTT 


CGTGCGCCGC 


CTCCCCCCCA 


GCGCCCGCAG 


CGAGCAGCTG 


60 


GAGGAGCTGT 


TCACCCACGT 


GGGCCCCGTG 


AAGCAGTGCT 


TCCTGGTGAC 


CGAGAAGGGC 


120 


AGCAAGGCCT 


GCCGCGGCTT 


CGGCTACGTG 


ACCTTCAGCA 


TGCTGGAGGA 


CGTGCAGCGC 


180 


GCCCTGAAOO 


AGATCACCAC 


CTTCGAGGGC 


TGCAAGATCA 


ACGTGACCGT 


GGCCAAGAAG 


240 


AAGCTGCGCA 


ACAAGACCAA 


GGAGAAGGGC 


AAGAACGAGA 


ACAGCCAGTG 


CCCCAAGAAG 


300 


GAGCCCAAGG 


CCAAGAAGGC 


CAAGGTGGCC 


GACAAGAAGG 


CCCGCCTGAT 


CATCCGCAAC 


360 


CTGAGCTTCA 


AGTGCAGCGA 


GGACGACCTG 


AAGACCGTGT 


TCGCCCAGTT 


CGGCGCCGTG 


420 


CTGGAGGTGA 


ACATCCCCCG 


CAAGCCCGAC 


GQCAAGATGC 


GCGGCTTCGG 


CTTCGTGCAG 


480 


TTCAAGAACC 


TGCTGGAGGC 


CGGCAAGGCC 


CTGAAGGGCA 


TGAACATGAA 


GGAGATCAAG 


540 


GGCCGCACCG 


TGGCCGTGGA 


CTGGCCCGTC 


GCCAAGGACA 


AGTACAAGGA 


CACCCAGAGC 


600 


GTGAGCGCCA 


TCCGCCAGGA 


GAAGAGCCAC 


GAGAGCAAGC 


ACCAGGAGAG 


CGTGAAGAAG 


660 


AAGGGCCGCG 


AGGAGGAGGA 


CATGGAGGAG 


GAGGAGAACG 


ACGACGACGA 


CGACGACGAC 


720 
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Q&GAGGAGG ACGGCGTGTT CGACGACGAG GACGAGGAGG AGGAGAACAT CGAGAGCAAG 780 

GTGACCAAGC CCGTGCAGAT CCAGAAGCGC GCCGTGAAGC GCCCCGCCCC CGCCAAGAGC 84 0 

AGCGACCACA GCGAGGAGGA CAGCGACCTG GAGGAGAGCG ACAGCATCGA CGACGGCGAG 900 

GAGCTGGCCC AGAGCGACAC CAGCACCGAG GAGCAGGACG ACAAGGCCGT GCAGGTGAGC 960 

AACAAGAAGA AGCGCAAGCT GCCCAGCGAC GTGAACGAGG GCAAGACCGT GTTCATCCGC 1020 

AACCTGAGCT TCGACAOCGA GGAGGAGGAG CTGGGCGAGC TGCTGCAGCA GTTCGGCGAG 1080 

CTGAAGTACG TGCGCATCGT GCTGCACCCC GACACCGAGC ACAGCAAGGG' CTGCGCCTTC 114 0 

GCCCAGTTCA TGACCCAGGA GGCCGCCCAG AAGTGCCTGC TGGCCGCCAG CCCCGAGAAC 1200 

GAGGCCGGCG GCCTGAAGCT GGACGGCCGC CAGCTGAAGG TGGACCTGGC CGTGACCCGC 1260 

GACGAGGCCG CCAAGCTGCA GACCACCAAG GTGAAGAAGC CCACCGGCAC CCGCAACCTG 1320 

TACCTGGCCC GCGAGGGCCT QATCCGCGCC GG CACCAAGG CCGCCGAGGG CGTGAGCGCC 13 80 

GCCGACATGG CCAAGCGCGA GCGCTTCGAG CTGCTGAAGC ACCAGAAGCT GAAGGACCAG 1440 

AACATCTTCC TGAGCCCCAC CCGCCTGTGC CTGCACAACC TGCCCAAGGC CGTGGACGAC IS 00 

AAGCAGCTGC GCAAGCTGCT GCTGAGCGCC ACCAGCGGCG AGAAGGGCGT GCGCATCAAG 1560 

GAGTGCCGCG TGATGCGCGA CCTGAAGGGC GTGCACGGCA ACATGAAGGG CCAGAGCCTG 1620 

GGCTACGCCT TCGCCGAGTT CCAGGAGCAC GAGCACGCCC TGAAGGCCCT GCGCCTGATC 1680 

AACAACAACC CCGAGATCTT CGGCCCCCTG AAGCGCCCCA TCGTGGAGTT CAGCCTCGAG 1740 

GACCGCCGCA AGCTGAAGAT GAAGGAGCTG CGCATCCAGC GCAGCCTGCA GAAGATGCGC 1800 

AGCAAGCCCG CCACCGGCGA GCCCCAGAAG GGCCAGCCCG AGCCCGCCAA GGACCAGCAG 1860 

CAGAAGGCCG CCCAGCACCA CACCGAGGAG CAGAGCAAGG TGCCCCCCGA GCAGAAGCGC 1920 

AAGGCCGGCA GCACCAGCTG GACCGGCTTC CAGACCAAGG CCGAGGTGGA GCAGGTGGAG 1980 

CTGCCCOACG GCAAGAAGCG CCGCAAGGTG CTGGCCCTGC CCAGCCACCG CGGCCCCAAG 204 0 

ATCCGCCTGC GCGACAAGGG CAAGGTGAAG CCCGTGCACC CCAAGAAGCC CAAGCCCCAG 2100 

ATCAACCAGT GGAAGCAGGA GAAGCAGCAG CTGAGCAGCG AGCAGGTGAG CCGCAAGAAG 2160 

GCCAAGGGCA ACAAGACCGA GACCCGCTTC AACCAGCTGG TGGAGCAGTA CAAGCAGAAG 222 0 

CTGCTGGGCC CCAGCAAGGG CGCCCCCCTG GCCAAGCGCA GCAAGTGGTT CGACAGC 2277 

(2) INFORMATION FOR SEQ ID KO t 6 t 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 540 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GGGTTGCGGA GGGTGGGCCT GGGAGGGGTG GTGGCCATTT TTTGTCTAAC CCTAACTGAG 60 

AAGGGCGTAG GCGCCGTGCT TTTGCTCCCC GCGCGCTGTT TTTCTCGCTG ACTTTCAGCG 120 

GGCGG AAAAG CCTCGGCCTG CCCCCTTCCA CCGTTCATTC TAGAGCAAAC AAAAAATGTC 180 
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AGCTGCTGGC CCGTTCGCCC CTCCCGGGGA CCTGCGGCGG GTCGCCTGCC CAGCCCCCGA 24 0 

ACCCCGCCTG GAGGCCGCGG TCGGCCCGGG GCTTCTCCGG AGGCACCCAC TGCCACCGCG 30 0 

AAGAGTTGGG CTCTGTCAGC CGCGGGTCTC TCGGGGGCGA GGGCGAGGTT CAGGCCTTTC 3 60 

AGGCCGCAGG AAGAGGAACG GAGCGAGTCC CCGCGCGCGG CGCGATTCCC TGAGCTGTGG 420 

GACGTGCACC CAGGACTCGG CTCACACATG CAGTTCGCTT TCCTGTTGGT GGGGGGAACG 480 

5 CCGATCGTGC GCATCCGTCA CCCCTCGCCG GCAGTGGGGG CTTGTGAACC CCCAAACCTG 540 



(2) INFORMATION FOR SEQ ID NO:7> 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 540 base- pairs 
10 (B) TYPE: nucleic acid 

<C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 7: 
15 GGGTTGCGGA GGGTGGOCCT GGGAGGGGTG GTGGCCATTT TTTGTCCAAC CCCAACTGAG 60 

AAGGGCGTAG GCGCCGTGCT TTTGCTCCCC GCGCGCTGTT TTTCTCGCTG ACTTTCAGCG 120 
GGCGGAAAAG CCTCGGCCTG CCGCCTTCCA CCGTTCATTC TAGAGCAAAC AAAAAATGTC 180 
AGCTGCTGGC CCGTTCGCCC CTCCCGGGGA CCTGCGGCGG GTCGCCTGCC CAGCCCCCGA 240 
ACCCCGCCTG GAGGCCGCGG TCGGCCCGGG GCTTCTCCGG AGGCACCCAC TGCCACCGCG 300 
20 AAGAGTTGGG CTCTGTCAGC CGCGGGTCTC TCGGGGGCGA GGGCGAGGTT CAGGCCTTTC 3 60 

AGGCCGCAGG AAGAGGAACG GAGCGAGTCC CCGCGCGCGG CGCGATTCCC TGAGCTGTGG 420 
GACGTGCACC CAGGACTCGG CTCACACATG CAGTTCGCTT TCCTGTTGGT GGGGGGAACG 4 80 

CCGATCGTGC GCATCCGTCA CCCCTCGCCG GCAGTGGGGG CTTGTGAACC CCCAAACCTG 540 



25 (2) INFORMATION FOR SEQ ID NO: 8; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: S40 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GGGTTGCGGA GGGTGGGCCT GGGAGGGGTG GTGGCCATTT TTTGTCTAAG CCTAAGTGAG 60 
AAGGGCGTAG GCGCCGTGCT TTTGCTCCCC GCGCGCTGTT TTTCTCGCTG ACTTTCAGCG 12 0 

35 GGCGGAAAAG CCTCGGCCTG CCGCCTTCCA CCGTTCATTC TAGAGCAAAC AAAAAATGTC 180 

AGCTGCTGGC CCGTTCGCCC CTCCCGGGGA CCTGCGGCGG GTCGCCTGCC CAGCCCCCGA 240 
ACCCCGCCTG GAGGCCGCGG TCGGCCCGGG GCTTCTCCGG AGGCACCCAC TGCCACCGCG 3 00 

25 
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AAGAGTTGGG CTCTGTCAGC CGCGGGTCTC TCGGGGGCGA GGGCGAGGTT CAGGCCTTTC 3 60 

AGGCCGCAGG AAGAGGAACG GAGCGAGTCC CCGCGCGCGG CGCGATTCCC TGAGCTGTGG 42 0 

GACGTCCACC .CAGGACTCGG CTCACACATG CAGTTCGCTT TCCTGTTGGT GGGGGGAACG 4 80 

CCGATCGTGC GCATCCGTCA CCCCTCGCCG GCAGTGGGGG CTTGTGAACC CCCAAACCTG 540 



5 (2) INFORMATION FOR SEQ ID NO ; 9 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
10 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



GGGTTGCGGA 


GGGTGGGCCT 


GGGAGGQGTG 


GTGGCCATTT 


TTTGTCTACC 


CTACTGAGAA 


60 


GGGCGTAGGC 


GCCGTGCTTT 


TGCTCCCCGC 


GCGCTGTTTT 


TCTCGCTGAC 


TTTCAGCGGG 


120 


CCGAAAAGCC TCCCCCTCCC GCCTTCCACC GTTCATTCTA GAGCAAACAA AAAATGTCAG 


180 


CTGCTGGCCC 


GTTCGCCCCT 


CCCGGGGACC 


TGCGGCGGGT 


CGCCTGCCCA GCCCCCGAAC 


240 


CCCGCCTGGA 


GGCCGCGGTC 


GGCCCGGGGC 


TTCTCCGGAG 


GCACCCACTG 


CCACCGCGAA 


300 


GAGTTGGGCT 


CTGTCAGCCG 


CGGGTCTCTC 


GGGGGCGAGG 


GCGAGGTTCA 


GGCCTTTCAG 


360 


GCCGCAGGAA 


GAGGAACGGA 


GCGAGTCCCC 


GCGCGCGGCG 


CGATTCCCTG 


AGCTGTGGGA 


420 


CGTGCACCCA 


GGACTCGGCT 


CACACATGCA 


GTTCGCTTTC 


CTGTTGGTGG 


GGGGAACGCC 


480 


GATCGTG CGC 


ATCCGTCACC 


CCTCGCCGGC AGTGGGGGCT 


TGTGAACCCC 


CAAACCTG 


538 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 
(ii) MOLECULE TYPE: peptide 

30 (ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 5. .13 

(D) OTHER INFORMATION; /note- "Xaa represents isoleucine 
or leucine" 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Glu Ala Ala Thr Xaa Asp Xaa Pro Gin Gin Gly Ala Asn Lys 
1 5 io 
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| WHATTS CLAIMED IS; 

1 . An isolated protein comprising a telomcrase protein selected from the group 
consisting of human telomerase protein pl40, pi 05, p48 and p43, having molecular weights 
of about 140kD, about 105kD, about 48kD and about 43kD, respectively, as determined by 

! polyacrylamide gel electrophoresis under denaturing conditions, or a human telomerase 

5 protein domain thereof having telomerase-specific activity. 

2. An isolated protein according to claim I, wherein said protein specifically binds at 
least one of the telomerase RNA of SEQ ID NO:6, a telomerase primer, or a nucleotide 
triphosphate. 

10 

3. An isolated protein according to claim 1, said protein isolated by: 

lysing HeLa cells by dounce homogenization in low-salt buffer to produce cell 
lysates and 

centrifuging said lysates supplemented with 15% glycerol at 125,000 x g for 50 minutes to 
1 5 obtain a first soluble fraction enriched for telomerase (S- 100 fraction); 

binding said S-100 fraction adjusted to 0.2 M ammonium sulfate to SP Sepharose 

(Pharmacia), and developing with a gradient in sodium chloride, to obtain a second soluble 

fraction enriched for telomerase (SP fraction); 

binding said SP fraction adjusted to about 0.3-0.4 M ionic strength to Q Sepharose 
20 (Pharmacia) adjusting said SP fraction, and developing with a gradient in sodium chloride, 

to obtain a third soluble fraction enriched for telomerase (Q fraction); binding said Q 

fraction adjusted to about 0.3-0.4 M ionic strength to phosphocellulose (Whatman), and 

developing in sodium chloride, to obtain a fourth soluble fraction enriched for telomerase 

(PC fraction); 

25 binding said PC fraction adjusted to about 0.3-0.4 M ionic strength to 2'Omethyl 

RNA 

oligonucleotide immobilized on streptavidin agarose (Sigma) and eluting with a 
electrophoresis sample medium comprising 5% (J-mercaptoethanol and 2% Sodium 
Dodecyl Sulfate to obtain a fifth soluble fraction (2'Omcthyl fraction); 
30 separating said 2 ! Omethyl fraction by polyacrylamide gel electrophoresis under 

denaturing conditions to obtain a gel band at a molecular weight of about 140kD, l05kD, 
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48kD or 43kD having telomerase activity; 

excising or cluting said gel band to obtain a human telomerase protein of a 
molecular weight of about 140kD, 105kD, 48kD or 43kD, respectively, as determined by 
polyacrylamide gel electrophoresis under denaturing conditions. 

4. An isolated protein comprising a portion of the amino acid sequence of SEQ ED 
NO: 1 sufficient for telomerase-specific activity. 

5. An isolated protein according to claim 4, wherein said protein comprises a human 
telomerase protein domain selected from the group consisting of an RNA binding domain, a 
telomerase subunit binding domain, and a substrate, agonist, antagonist, chaperone or 
cytoskeletal binding domain. 

6. A recombinant nucleic acid encoding a protein according to claim 1 or 4. 

7. A cell comprising a nucleic acid according to claim 6. 

8. A method of making an isolated telomerase protein, comprising steps: introducing a 
nucleic acid according to claim 6 into a host cell or cellular extract, incubating said host cell 
or extract under conditions whereby said nucleic acid is expressed as a transcript and said 
transcript is expressed as a translation product comprising said protein, and isolating said 
translation product. 

9. An isolated human telomerase protein made by the method of claim 8. 

10. An isolated human telomerase nucleic acid comprising SEQ ID NO:6, or a fragment 
thereof sufficient to specifically hybridize with, or amplify from a nucleic acid having the 
sequence defined by SEQ ID NO:6. 

11. An isolated telomerase nucleic acid according to claim 1 0 comprising at least one of 
SEQ ID NO:6, bases 191-210, bases 245-259, bases 341-369 and bases 381-399. 
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12. A method of screening for an agent which modulates the binding of a human 
telomerase protein to a binding target, said method comprising the steps of: 

| incubating a mixture comprising: 

| an isolated protein according to claim 1, 

; a binding target of said protein, and 

I 

| 5 a candidate agent; 

under conditions whereby, but for the presence of said agent, said protein 
specifically binds said binding target at a reference affinity; 

detecting the binding affinity of said protein to said binding target to determine an 
agent-biased affinity, 

10 wherein a difference between the agent-biased affinity and the reference affinity 

indicates that said agent modulates the binding of said protein to said binding target. 

13. A method according to claim 12, wherein said binding target is a substrate of said 
protein and said reference and agent-biased binding affinity are each detected as the 

1 5 polymerization by said protein of a nucleic acid on said substrate. 

14. An isolated somatically recombined protein receptor which specifically binds a 
protein according to claim 1, wherein said receptor is an antibody or a T-cell antigen 
receptor. . 

20 

15. A method of modulating the binding of a human telomerase or telomerase protein to 
a human telomerase binding target, said method comprising contacting said telomerase or 
telomerase protein with a receptor according to claim 14. 

25 16. A method according to claim 15, wherein said binding target is a substrate of said 

telomerase and said receptor inhibits the polymerization by said telomerase of a nucleic acid 
on said substrate. 

1 7. A method of polymerizing a nucleic acid on a substrate, comprising contacting said 
30 substrate with a telomerase comprising a protein according to claim 1, wherein said 

telomerase comprises a nucleic acid template having a preselected nucleotide sequence. 
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